Global Predicate Detection in Distributed Systems with Small Faults
نویسندگان
چکیده
We study the problem of global predicate detection in presence of permanent and transient failures. We term the transient failures as small faults. We show that it is impossible to detect predicates in an asynchronous distributed system prone to small faults even if nodes are equipped with a powerful device known as failure detector sequencer (denoted by Σ). To redress this impossibility, we introduce a theoretical device, known as a small fault sequencer (denoted by ΣSF ), and show that ΣSF is necessary and sufficient for predicate detection. Unfortunately, we also show that ΣSF cannot be implemented even in a synchronous distributed system. Fortunately, however, we show that predicate detection can be achieved with high probability in synchronous systems.
منابع مشابه
Software Fault Tolerance of Distributed Programs Using Computation Slicing
Writing correct distributed programs is hard. In spite of extensive testing and debugging, software faults persist even in commercial grade software. Many distributed systems, especially those employed in safety-critical environments, should be able to operate properly even in the presence of software faults. Monitoring the execution of a distributed system, and, on detecting a fault, initiatin...
متن کاملSelf-stabilization of Byzantine Protocols
Awareness of the need for robustness in distributed systems increases as distributed systems become integral parts of day-to-day systems. Self-stabilizing while tolerating ongoing Byzantine faults are wishful properties of a distributed system. Many distributed tasks (e.g. clock synchronization) possess e cient non-stabilizing solutions tolerating Byzantine faults or conversely non-Byzantine bu...
متن کاملEfficient Distributed Detection of Conjunctions of Local Predicates
Global predicate detection is a fundamental problem in distributed systems and nds applications in many domains such as testing and debugging distributed programs. This paper presents two eecient distributed algorithms to detect conjunctive form global predicates in distributed systems. The algorithms detect the rst consistent global state that satisses the predicate even if the predicate is un...
متن کاملConsistent Detection of Global Predicates in Asynchronous Distributed Systems with Crash Failures
A fundamental problem in testing and analysis of distributed systems is detecting whether the global state of a system satisses some predicate throughout a computation. There exist some well-developed methods for detecting predicates in fault-free systems (e.g., the predicate transformers possibly and deenitely of Cooper, Marzullo and Neiger), but for systems running in faulty environments ndin...
متن کاملDeriving distributed algorithms from a general predicate detector
Designing and debugging distributed systems requires the detection of conditions across the entire system. As an illustration, monitoring the status of an application requires detection of termination, and using virtual time requires the periodic calculation of the global virtual time. The Generalized Conjunctive Predicate (GCP) detector ooers a method to derive detection algorithms for these a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2007